Transformation-based tree-to-tree alignment

نویسنده

  • Gideon Kotzé
چکیده

Previous experiments suggest that a rule-based approach to tree alignment error correction serves to be an effective complement to statistical alignment. We show how, using relatively few features, an implementation of Brill’s Transformation-Based Learning algorithm improves the results of a high precision model of the statistical aligner Lingua-Align. Using our system to correct already tree aligned data, we achieve balanced F-scores of 80.6 on our test set and 85.2 on our development test set. Using it as a tree aligner on word aligned data, our best F-scores using the same model amount to 78.7 and 83.0 respectively. Finally, we apply a pipeline of alignment and error correction tools to create several versions of a large parallel treebank consisting of various domains for Dutch to English for use in a syntax-based MT system. We conclude that transformation-based learning is a promising approach for the large-scale creation of parallel treebanks for various NLP purposes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thomas Hardy's Under the Greenwood Tree : An Althusserian Perspective

With the beginning of the 19th century, England entered into a transitional period. Sociologists believe that in each era of transformation, the ruling class tries to establish its own values, but some resistance to these new values is inevitable. Thomas Hardy's novels in general, and Under the Greenwood Tree specifically, are no exception. Based on these notions, this paper tries to interpret ...

متن کامل

Determining Difference in Evolutionary Variation of Bacterial RecA proteins vs 16SrRNA Genes by using 16s_Toxonomy Tree

Background and Aims: The rate of variation in various genes of a bacterial species is different during evolution. Therefore, in systematic bacterial studies many researchers compare the phylogenetic tree of a particular gene to the standard tree of an rRNA gene. Regarding the importance of 16SrRNA gene and the evolutional process of RecA protein family, we investigated the changes in the select...

متن کامل

Predicting Twist Condition by Bayesian Classification and Decision Tree Techniques

Railway infrastructures are among the most important national assets of countries. Most of the annual budget of infrastructure managers are spent on repairing, improving and maintaining railways. The best repair method should consider all economic and technical aspects of the problem. In recent years, data analysis of maintenance records has contributed significantly for minimizing the costs. B...

متن کامل

Performance Metrics and Their Extraction Methods for Audio Rendered Mathematics

We introduce and compare three approaches to calculate structureand content-based performance metrics for user-based evaluation of math audio rendering systems: Syntax Tree alignment, Baseline Structure Tree alignment, and MathML Tree Edit Distance. While the first two require “manual” tree transformation and alignment of the mathematical expressions, the third obtains the metrics without human...

متن کامل

Restoration of the Mechanical Axis in Total Knee Artrhoplasty Using Patient-Matched Technology Cutting Blocks. A Retrospective Study of 132 Cases

Background: The aim of this study is to evaluate the accuracy of bone cuts and the resultant alignment, using theMyKnee patient specific cutting blocks.Methods: We retrospectively reviewed 132 patients undergoing primary TKR for osteoarthritis by one single surgeon.The operative time, the preoperative Hip-Knee-Ankle (HKA) axis based on the CT-scan, the postoperative HKA axisbased on long axis s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013